Reinforcement Learning based Embodied Agents Modelling Human Users Through Interaction and Multi-Sensory Perception

نویسندگان

  • Kory Wallace Mathewson
  • Patrick M. Pilarski
چکیده

This paper extends recent work in interactive machine learning (IML) focused on effectively incorporating human feedback. We show how control and feedback signals complement each other in systems which model human reward. We demonstrate that simultaneously incorporating human control and feedback signals can improve interactive robotic systems performance on a self-mirrored movement control task where a RL-agent controlled right arm attempts to match the preprogrammed movement pattern of the left arm. We illustrate the impact of varying human feedback parameters on task performance by investigating the probability of giving feedback on each time step and the likelihood of given feedback being correct. We further illustrate that varying the temporal decay with which the agent incorporates human feedback has a significant impact on task performance. We found that smearing human feedback over time steps improves performance and we show varying the probability of feedback at each time step, and an increased likelihood of those feedbacks being ’correct’, can impact agent performance. We conclude that understanding latent variables in human feedback is crucial for learning algorithms acting in human-machine interac-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning in embodied action-perception loops through exploration

Although exploratory behaviors are ubiquitous in the animal kingdom, their computational underpinnings are still largely unknown. Behavioral Psychology has identified learning as a primary drive underlying many exploratory behaviors. Exploration is seen as a means for an animal to gather sensory data useful for reducing its ignorance about the environment. While related problems have been addre...

متن کامل

MARL-Ped: A multi-agent reinforcement learning based framework to simulate pedestrian groups

Pedestrian simulation is complex because there are different levels of behavior modeling. At the lowest level, local interactions between agents occur; at the middle level, strategic and tactical behaviors appear like overtakings or route choices; and at the highest level path-planning is necessary. The agent-based pedestrian simulators either focus on a specific level (mainly in the lower one)...

متن کامل

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

Personalised Human-Robot Co-Adaptation in Instructional Settings using Reinforcement Learning

In the domain of robotic tutors, personalised tutoring has started to receive scientists’ attention, but is still relatively underexplored. Previous work using reinforcement learning (RL) has addressed personalised tutoring from the perspective of affective policy learning. In this paper we build on previous work on affective policy learning that used RL to learn what robot’s supportive behavio...

متن کامل

Autonomous Acquisition of the Meaning of Sensory States Through Sensory-Invariance Driven Action

How can artificial or natural agents autonomously gain understanding of its own internal (sensory) state? This is an important question not just for physically embodied agents but also for software agents in the information technology environment. In this paper, we investigate this issue in the context of a simple biologically motivated sensorimotor agent. We observe and acknowledge, as many ot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1701.02369  شماره 

صفحات  -

تاریخ انتشار 2016